2 Exploratory Analysis
For this project, we will be using the data from from the California Housing dataset included in the scikit-learn library. The dataset contains 20,640 observations on housing prices in California. The dataset has 8 features and 1 target variable. The features are as follows:
- MedInc: Median Income
- HouseAge: Median House Age
- AveRooms: Average Number of Rooms
- AveBedrms: Average Number of Bedrooms
- Population: Population
- AveOccup: Average Occupancy
- Latitude: Latitude
- Longitude: Longitude
The target variable is:
- MedHouseVal: Median House Value